Overview

Dataset statistics

Number of variables44
Number of observations998
Missing cells9174
Missing cells (%)20.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory343.2 KiB
Average record size in memory352.1 B

Variable types

Numeric16
Categorical23
Unsupported5

Alerts

TIPO_CULTIVO has constant value ""Constant
DEPARTAMENTO has constant value ""Constant
FECHA_SIEMBRA has a high cardinality: 204 distinct valuesHigh cardinality
FECHA_EMERGENCIA has a high cardinality: 205 distinct valuesHigh cardinality
FECHA_FLORACION has a high cardinality: 212 distinct valuesHigh cardinality
FECHA_COSECHA has a high cardinality: 226 distinct valuesHigh cardinality
NOMBRE_LOTE has a high cardinality: 727 distinct valuesHigh cardinality
NUEVO_MATERIAL_GENETICO has a high cardinality: 82 distinct valuesHigh cardinality
OBSERVACIONES_COSECHA has a high cardinality: 132 distinct valuesHigh cardinality
CULT_ANT is highly imbalanced (51.3%)Imbalance
PROD_COSECHADO is highly imbalanced (63.8%)Imbalance
TIPO_DE_SEMILLA has 998 (100.0%) missing valuesMissing
HABITO_CRECIMIENTO has 998 (100.0%) missing valuesMissing
OBJ_RDT has 176 (17.6%) missing valuesMissing
FECHA_COSECHA has 12 (1.2%) missing valuesMissing
METODO_COSECHA has 12 (1.2%) missing valuesMissing
RDT has 12 (1.2%) missing valuesMissing
PROD_COSECHADO has 12 (1.2%) missing valuesMissing
ORIGEN_SEMILLA has 998 (100.0%) missing valuesMissing
INOCULACION_SEMILLAS has 998 (100.0%) missing valuesMissing
NUEVA_INOCULACION_SEMILLAS has 998 (100.0%) missing valuesMissing
PRODUCTO_USADO has 571 (57.2%) missing valuesMissing
NUEVO_MATERIAL_GENETICO has 712 (71.3%) missing valuesMissing
OTRO_CULT_ANT has 994 (99.6%) missing valuesMissing
RESIEMBRA has 781 (78.3%) missing valuesMissing
CANTIDAD_TOTAL has 50 (5.0%) missing valuesMissing
HUMEDAD has 113 (11.3%) missing valuesMissing
OBSERVACIONES_COSECHA has 703 (70.4%) missing valuesMissing
CANTIDAD_TOTAL is highly skewed (γ1 = 20.58704714)Skewed
OTRO_CULT_ANT is uniformly distributedUniform
ID_EVENTO has unique valuesUnique
ID_LOTE has unique valuesUnique
TIPO_DE_SEMILLA is an unsupported type, check if it needs cleaning or further analysisUnsupported
HABITO_CRECIMIENTO is an unsupported type, check if it needs cleaning or further analysisUnsupported
ORIGEN_SEMILLA is an unsupported type, check if it needs cleaning or further analysisUnsupported
INOCULACION_SEMILLAS is an unsupported type, check if it needs cleaning or further analysisUnsupported
NUEVA_INOCULACION_SEMILLAS is an unsupported type, check if it needs cleaning or further analysisUnsupported
OBJ_RDT has 10 (1.0%) zerosZeros
RESIEMBRA has 123 (12.3%) zerosZeros

Reproduction

Analysis started2023-02-20 21:52:42.871924
Analysis finished2023-02-20 21:53:46.305867
Duration1 minute and 3.43 seconds
Software versionydata-profiling v0.0.dev0
Download configurationconfig.json

Variables

ID_EVENTO
Real number (ℝ)

Distinct998
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3040.2214
Minimum53
Maximum4675
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum53
5-th percentile813.85
Q12292.25
median3075.5
Q33917.75
95-th percentile4618.3
Maximum4675
Range4622
Interquartile range (IQR)1625.5

Descriptive statistics

Standard deviation1069.9872
Coefficient of variation (CV)0.35194383
Kurtosis-0.081858374
Mean3040.2214
Median Absolute Deviation (MAD)806
Skewness-0.39546436
Sum3034141
Variance1144872.6
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53 1
 
0.1%
3320 1
 
0.1%
3307 1
 
0.1%
3308 1
 
0.1%
3309 1
 
0.1%
3310 1
 
0.1%
3311 1
 
0.1%
3312 1
 
0.1%
3313 1
 
0.1%
3314 1
 
0.1%
Other values (988) 988
99.0%
ValueCountFrequency (%)
53 1
0.1%
54 1
0.1%
56 1
0.1%
57 1
0.1%
273 1
0.1%
282 1
0.1%
283 1
0.1%
284 1
0.1%
286 1
0.1%
287 1
0.1%
ValueCountFrequency (%)
4675 1
0.1%
4674 1
0.1%
4673 1
0.1%
4672 1
0.1%
4671 1
0.1%
4670 1
0.1%
4669 1
0.1%
4668 1
0.1%
4667 1
0.1%
4666 1
0.1%

ID_LOTE
Real number (ℝ)

Distinct998
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2799.0541
Minimum40
Maximum4432
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum40
5-th percentile689.85
Q12081.25
median2770.5
Q33723
95-th percentile4283.15
Maximum4432
Range4392
Interquartile range (IQR)1641.75

Descriptive statistics

Standard deviation1029.8935
Coefficient of variation (CV)0.36794342
Kurtosis-0.25397341
Mean2799.0541
Median Absolute Deviation (MAD)780
Skewness-0.3167215
Sum2793456
Variance1060680.7
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40 1
 
0.1%
3039 1
 
0.1%
3026 1
 
0.1%
3027 1
 
0.1%
3028 1
 
0.1%
3029 1
 
0.1%
3030 1
 
0.1%
3031 1
 
0.1%
3032 1
 
0.1%
3033 1
 
0.1%
Other values (988) 988
99.0%
ValueCountFrequency (%)
40 1
0.1%
43 1
0.1%
44 1
0.1%
45 1
0.1%
46 1
0.1%
47 1
0.1%
51 1
0.1%
268 1
0.1%
269 1
0.1%
270 1
0.1%
ValueCountFrequency (%)
4432 1
0.1%
4382 1
0.1%
4381 1
0.1%
4380 1
0.1%
4379 1
0.1%
4378 1
0.1%
4377 1
0.1%
4376 1
0.1%
4372 1
0.1%
4371 1
0.1%

ID_FINCA
Real number (ℝ)

Distinct992
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2783.521
Minimum42
Maximum4650
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum42
5-th percentile683.85
Q11941
median2657.5
Q33959.25
95-th percentile4511.15
Maximum4650
Range4608
Interquartile range (IQR)2018.25

Descriptive statistics

Standard deviation1136.7191
Coefficient of variation (CV)0.40837454
Kurtosis-0.63398398
Mean2783.521
Median Absolute Deviation (MAD)791
Skewness-0.0031504205
Sum2777954
Variance1292130.4
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3714 2
 
0.2%
2224 2
 
0.2%
1979 2
 
0.2%
2820 2
 
0.2%
1991 2
 
0.2%
1716 2
 
0.2%
2904 1
 
0.1%
2905 1
 
0.1%
2906 1
 
0.1%
2913 1
 
0.1%
Other values (982) 982
98.4%
ValueCountFrequency (%)
42 1
0.1%
43 1
0.1%
44 1
0.1%
45 1
0.1%
46 1
0.1%
47 1
0.1%
50 1
0.1%
240 1
0.1%
241 1
0.1%
242 1
0.1%
ValueCountFrequency (%)
4650 1
0.1%
4618 1
0.1%
4617 1
0.1%
4616 1
0.1%
4615 1
0.1%
4614 1
0.1%
4613 1
0.1%
4612 1
0.1%
4609 1
0.1%
4608 1
0.1%

ID_PROD
Real number (ℝ)

Distinct614
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2625.7936
Minimum13
Maximum4835
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum13
5-th percentile823.1
Q12035
median2461.5
Q32897
95-th percentile4757
Maximum4835
Range4822
Interquartile range (IQR)862

Descriptive statistics

Standard deviation1056.9347
Coefficient of variation (CV)0.40252011
Kurtosis0.47266281
Mean2625.7936
Median Absolute Deviation (MAD)430.5
Skewness0.51466799
Sum2620542
Variance1117111
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4733 12
 
1.2%
4757 8
 
0.8%
2848 7
 
0.7%
2843 7
 
0.7%
1843 7
 
0.7%
2905 6
 
0.6%
2696 6
 
0.6%
1901 6
 
0.6%
2375 6
 
0.6%
2869 6
 
0.6%
Other values (604) 927
92.9%
ValueCountFrequency (%)
13 1
0.1%
14 1
0.1%
15 1
0.1%
16 1
0.1%
17 1
0.1%
18 1
0.1%
22 1
0.1%
266 1
0.1%
267 1
0.1%
268 1
0.1%
ValueCountFrequency (%)
4835 4
0.4%
4834 2
0.2%
4833 1
 
0.1%
4831 1
 
0.1%
4829 1
 
0.1%
4828 1
 
0.1%
4826 3
0.3%
4825 1
 
0.1%
4824 1
 
0.1%
4791 2
0.2%

LAT_LOTE
Real number (ℝ)

Distinct964
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.9695112
Minimum8.093797
Maximum9.7194694
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum8.093797
5-th percentile8.753319
Q18.8915194
median9.0039733
Q39.0808727
95-th percentile9.1372692
Maximum9.7194694
Range1.6256724
Interquartile range (IQR)0.18935322

Descriptive statistics

Standard deviation0.15742138
Coefficient of variation (CV)0.01755072
Kurtosis7.13899
Mean8.9695112
Median Absolute Deviation (MAD)0.088194444
Skewness-1.6697931
Sum8951.5722
Variance0.024781492
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.943611111 3
 
0.3%
9.095222222 3
 
0.3%
8.877222222 2
 
0.2%
8.99237 2
 
0.2%
9.076666667 2
 
0.2%
8.901666667 2
 
0.2%
8.906111111 2
 
0.2%
8.940555556 2
 
0.2%
9.103 2
 
0.2%
9.014047 2
 
0.2%
Other values (954) 976
97.8%
ValueCountFrequency (%)
8.093797 1
0.1%
8.096087 1
0.1%
8.215555556 1
0.1%
8.24431 1
0.1%
8.245634 1
0.1%
8.252012 1
0.1%
8.257094 1
0.1%
8.265328 1
0.1%
8.266452452 1
0.1%
8.268726 1
0.1%
ValueCountFrequency (%)
9.719469444 1
0.1%
9.624166667 1
0.1%
9.464722222 1
0.1%
9.302305556 1
0.1%
9.300333333 1
0.1%
9.300194444 1
0.1%
9.297694444 1
0.1%
9.296055556 1
0.1%
9.295694444 1
0.1%
9.284944444 1
0.1%

LONG_LOTE
Real number (ℝ)

Distinct946
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-75.760976
Minimum-76.573611
Maximum-75.413889
Zeros0
Zeros (%)0.0%
Negative998
Negative (%)100.0%
Memory size7.9 KiB

Quantile statistics

Minimum-76.573611
5-th percentile-75.839865
Q1-75.800743
median-75.766667
Q3-75.721301
95-th percentile-75.633362
Maximum-75.413889
Range1.1597222
Interquartile range (IQR)0.07944236

Descriptive statistics

Standard deviation0.1004434
Coefficient of variation (CV)-0.0013257934
Kurtosis14.68839
Mean-75.760976
Median Absolute Deviation (MAD)0.038639945
Skewness-1.3878058
Sum-75609.454
Variance0.010088878
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-75.76666667 4
 
0.4%
-75.76111111 3
 
0.3%
-75.79197222 2
 
0.2%
-75.73041667 2
 
0.2%
-75.715054 2
 
0.2%
-75.81197222 2
 
0.2%
-75.718249 2
 
0.2%
-75.76588889 2
 
0.2%
-75.716401 2
 
0.2%
-75.68686111 2
 
0.2%
Other values (936) 975
97.7%
ValueCountFrequency (%)
-76.57361111 1
0.1%
-76.53222222 1
0.1%
-76.46222222 1
0.1%
-76.28138889 1
0.1%
-76.172647 1
0.1%
-76.163395 1
0.1%
-76.16081947 1
0.1%
-76.160776 1
0.1%
-76.159909 1
0.1%
-76.15985324 1
0.1%
ValueCountFrequency (%)
-75.41388889 1
0.1%
-75.41444444 1
0.1%
-75.42861111 1
0.1%
-75.43416667 1
0.1%
-75.437004 1
0.1%
-75.43722222 1
0.1%
-75.438068 1
0.1%
-75.45166667 1
0.1%
-75.45416667 1
0.1%
-75.45833333 1
0.1%

FECHA_SIEMBRA
Categorical

Distinct204
Distinct (%)20.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
5/20/2015
 
63
5/25/2015
 
31
5/18/2015
 
29
5/21/2015
 
25
5/10/2016
 
24
Other values (199)
826 

Length

Max length10
Median length9
Mean length8.8206413
Min length8

Characters and Unicode

Total characters8803
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)8.1%

Sample

1st row5/13/2013
2nd row5/2/2013
3rd row5/12/2013
4th row5/7/2013
5th row5/7/2013

Common Values

ValueCountFrequency (%)
5/20/2015 63
 
6.3%
5/25/2015 31
 
3.1%
5/18/2015 29
 
2.9%
5/21/2015 25
 
2.5%
5/10/2016 24
 
2.4%
5/12/2016 22
 
2.2%
5/9/2016 20
 
2.0%
5/7/2016 18
 
1.8%
5/15/2015 17
 
1.7%
5/4/2016 17
 
1.7%
Other values (194) 732
73.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
5/20/2015 63
 
6.3%
5/25/2015 31
 
3.1%
5/18/2015 29
 
2.9%
5/21/2015 25
 
2.5%
5/10/2016 24
 
2.4%
5/12/2016 22
 
2.2%
5/9/2016 20
 
2.0%
5/7/2016 18
 
1.8%
5/15/2015 17
 
1.7%
5/4/2016 17
 
1.7%
Other values (194) 732
73.3%

Most occurring characters

ValueCountFrequency (%)
/ 1996
22.7%
1 1520
17.3%
2 1469
16.7%
5 1450
16.5%
0 1271
14.4%
6 424
 
4.8%
9 185
 
2.1%
4 160
 
1.8%
3 136
 
1.5%
8 101
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6807
77.3%
Other Punctuation 1996
 
22.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1520
22.3%
2 1469
21.6%
5 1450
21.3%
0 1271
18.7%
6 424
 
6.2%
9 185
 
2.7%
4 160
 
2.4%
3 136
 
2.0%
8 101
 
1.5%
7 91
 
1.3%
Other Punctuation
ValueCountFrequency (%)
/ 1996
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8803
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
/ 1996
22.7%
1 1520
17.3%
2 1469
16.7%
5 1450
16.5%
0 1271
14.4%
6 424
 
4.8%
9 185
 
2.1%
4 160
 
1.8%
3 136
 
1.5%
8 101
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8803
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 1996
22.7%
1 1520
17.3%
2 1469
16.7%
5 1450
16.5%
0 1271
14.4%
6 424
 
4.8%
9 185
 
2.1%
4 160
 
1.8%
3 136
 
1.5%
8 101
 
1.1%

TIPO_SIEMBRA
Categorical

Distinct2
Distinct (%)0.2%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
Mecanizado
675 
Manual
322 

Length

Max length10
Median length10
Mean length8.7081244
Min length6

Characters and Unicode

Total characters8682
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMecanizado
2nd rowMecanizado
3rd rowMecanizado
4th rowMecanizado
5th rowMecanizado

Common Values

ValueCountFrequency (%)
Mecanizado 675
67.6%
Manual 322
32.3%
(Missing) 1
 
0.1%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
mecanizado 675
67.7%
manual 322
32.3%

Most occurring characters

ValueCountFrequency (%)
a 1994
23.0%
M 997
11.5%
n 997
11.5%
e 675
 
7.8%
c 675
 
7.8%
i 675
 
7.8%
z 675
 
7.8%
d 675
 
7.8%
o 675
 
7.8%
u 322
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7685
88.5%
Uppercase Letter 997
 
11.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1994
25.9%
n 997
13.0%
e 675
 
8.8%
c 675
 
8.8%
i 675
 
8.8%
z 675
 
8.8%
d 675
 
8.8%
o 675
 
8.8%
u 322
 
4.2%
l 322
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
M 997
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8682
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1994
23.0%
M 997
11.5%
n 997
11.5%
e 675
 
7.8%
c 675
 
7.8%
i 675
 
7.8%
z 675
 
7.8%
d 675
 
7.8%
o 675
 
7.8%
u 322
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8682
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1994
23.0%
M 997
11.5%
n 997
11.5%
e 675
 
7.8%
c 675
 
7.8%
i 675
 
7.8%
z 675
 
7.8%
d 675
 
7.8%
o 675
 
7.8%
u 322
 
3.7%

NUM_SEMILLAS
Real number (ℝ)

Distinct55
Distinct (%)5.5%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean5753.2376
Minimum0.8
Maximum80000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum0.8
5-th percentile15.4
Q118
median20
Q321
95-th percentile65000
Maximum80000
Range79999.2
Interquartile range (IQR)3

Descriptive statistics

Standard deviation18554.219
Coefficient of variation (CV)3.2250047
Kurtosis6.9800343
Mean5753.2376
Median Absolute Deviation (MAD)2
Skewness2.9729163
Sum5735977.9
Variance3.4425903 × 108
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 231
23.1%
18 142
14.2%
16 97
9.7%
21 74
 
7.4%
19 69
 
6.9%
22 52
 
5.2%
17 52
 
5.2%
15 37
 
3.7%
60000 27
 
2.7%
17.5 22
 
2.2%
Other values (45) 194
19.4%
ValueCountFrequency (%)
0.8 1
 
0.1%
1 1
 
0.1%
7 1
 
0.1%
12 3
 
0.3%
13 3
 
0.3%
14 4
 
0.4%
15 37
 
3.7%
15.5 1
 
0.1%
16 97
9.7%
16.5 5
 
0.5%
ValueCountFrequency (%)
80000 3
 
0.3%
75000 6
0.6%
74000 1
 
0.1%
73000 3
 
0.3%
72000 3
 
0.3%
71000 1
 
0.1%
70000 4
0.4%
68400 1
 
0.1%
68000 4
0.4%
67000 9
0.9%

SEM_TRATADAS
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
NO
573 
SI
425 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1996
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNO
2nd rowSI
3rd rowNO
4th rowNO
5th rowNO

Common Values

ValueCountFrequency (%)
NO 573
57.4%
SI 425
42.6%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
no 573
57.4%
si 425
42.6%

Most occurring characters

ValueCountFrequency (%)
N 573
28.7%
O 573
28.7%
S 425
21.3%
I 425
21.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1996
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 573
28.7%
O 573
28.7%
S 425
21.3%
I 425
21.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 1996
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 573
28.7%
O 573
28.7%
S 425
21.3%
I 425
21.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1996
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 573
28.7%
O 573
28.7%
S 425
21.3%
I 425
21.3%

DIST_SURCOS
Real number (ℝ)

Distinct9
Distinct (%)0.9%
Missing2
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.79879518
Minimum0.2
Maximum1.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum0.2
5-th percentile0.8
Q10.8
median0.8
Q30.8
95-th percentile0.8
Maximum1.5
Range1.3
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.051054747
Coefficient of variation (CV)0.06391469
Kurtosis95.641997
Mean0.79879518
Median Absolute Deviation (MAD)0
Skewness-1.7054746
Sum795.6
Variance0.0026065872
MonotonicityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0.8 929
93.1%
0.7 25
 
2.5%
0.75 16
 
1.6%
1 15
 
1.5%
0.9 4
 
0.4%
0.2 3
 
0.3%
0.85 2
 
0.2%
1.5 1
 
0.1%
0.5 1
 
0.1%
(Missing) 2
 
0.2%
ValueCountFrequency (%)
0.2 3
 
0.3%
0.5 1
 
0.1%
0.7 25
 
2.5%
0.75 16
 
1.6%
0.8 929
93.1%
0.85 2
 
0.2%
0.9 4
 
0.4%
1 15
 
1.5%
1.5 1
 
0.1%
ValueCountFrequency (%)
1.5 1
 
0.1%
1 15
 
1.5%
0.9 4
 
0.4%
0.85 2
 
0.2%
0.8 929
93.1%
0.75 16
 
1.6%
0.7 25
 
2.5%
0.5 1
 
0.1%
0.2 3
 
0.3%

DIST_PLANTAS
Real number (ℝ)

Distinct25
Distinct (%)2.5%
Missing2
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.24044779
Minimum0.1
Maximum1.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum0.1
5-th percentile0.14
Q10.15
median0.2
Q30.2
95-th percentile0.5
Maximum1.5
Range1.4
Interquartile range (IQR)0.05

Descriptive statistics

Standard deviation0.1451745
Coefficient of variation (CV)0.60376723
Kurtosis10.200469
Mean0.24044779
Median Absolute Deviation (MAD)0.04
Skewness2.6082481
Sum239.486
Variance0.021075634
MonotonicityNot monotonic
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0.2 269
27.0%
0.18 171
17.1%
0.15 121
12.1%
0.14 107
 
10.7%
0.5 103
 
10.3%
0.4 56
 
5.6%
0.17 33
 
3.3%
0.3 32
 
3.2%
0.16 24
 
2.4%
0.13 22
 
2.2%
Other values (15) 58
 
5.8%
ValueCountFrequency (%)
0.1 1
 
0.1%
0.12 5
 
0.5%
0.125 10
 
1.0%
0.13 22
 
2.2%
0.14 107
10.7%
0.15 121
12.1%
0.16 24
 
2.4%
0.165 1
 
0.1%
0.17 33
 
3.3%
0.18 171
17.1%
ValueCountFrequency (%)
1.5 1
 
0.1%
1 6
 
0.6%
0.8 2
 
0.2%
0.75 1
 
0.1%
0.7 10
 
1.0%
0.6 1
 
0.1%
0.55 3
 
0.3%
0.5 103
10.3%
0.4 56
5.6%
0.35 6
 
0.6%

TIPO_CULTIVO
Categorical

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Maiz
998 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters3992
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMaiz
2nd rowMaiz
3rd rowMaiz
4th rowMaiz
5th rowMaiz

Common Values

ValueCountFrequency (%)
Maiz 998
100.0%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
maiz 998
100.0%

Most occurring characters

ValueCountFrequency (%)
M 998
25.0%
a 998
25.0%
i 998
25.0%
z 998
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2994
75.0%
Uppercase Letter 998
 
25.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 998
33.3%
i 998
33.3%
z 998
33.3%
Uppercase Letter
ValueCountFrequency (%)
M 998
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3992
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 998
25.0%
a 998
25.0%
i 998
25.0%
z 998
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3992
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M 998
25.0%
a 998
25.0%
i 998
25.0%
z 998
25.0%

COLOR_ENDOSPERMO
Categorical

Distinct2
Distinct (%)0.2%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
Blanco
553 
Amarillo
444 

Length

Max length8
Median length6
Mean length6.890672
Min length6

Characters and Unicode

Total characters6870
Distinct characters10
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBlanco
2nd rowBlanco
3rd rowBlanco
4th rowBlanco
5th rowBlanco

Common Values

ValueCountFrequency (%)
Blanco 553
55.4%
Amarillo 444
44.5%
(Missing) 1
 
0.1%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
blanco 553
55.5%
amarillo 444
44.5%

Most occurring characters

ValueCountFrequency (%)
l 1441
21.0%
a 997
14.5%
o 997
14.5%
B 553
 
8.0%
n 553
 
8.0%
c 553
 
8.0%
A 444
 
6.5%
m 444
 
6.5%
r 444
 
6.5%
i 444
 
6.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5873
85.5%
Uppercase Letter 997
 
14.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 1441
24.5%
a 997
17.0%
o 997
17.0%
n 553
 
9.4%
c 553
 
9.4%
m 444
 
7.6%
r 444
 
7.6%
i 444
 
7.6%
Uppercase Letter
ValueCountFrequency (%)
B 553
55.5%
A 444
44.5%

Most occurring scripts

ValueCountFrequency (%)
Latin 6870
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 1441
21.0%
a 997
14.5%
o 997
14.5%
B 553
 
8.0%
n 553
 
8.0%
c 553
 
8.0%
A 444
 
6.5%
m 444
 
6.5%
r 444
 
6.5%
i 444
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6870
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 1441
21.0%
a 997
14.5%
o 997
14.5%
B 553
 
8.0%
n 553
 
8.0%
c 553
 
8.0%
A 444
 
6.5%
m 444
 
6.5%
r 444
 
6.5%
i 444
 
6.5%

SEM_POR_SITIO
Categorical

Distinct5
Distinct (%)0.5%
Missing3
Missing (%)0.3%
Memory size7.9 KiB
1.0
717 
3.0
188 
2.0
78 
4.0
 
11
5.0
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters2985
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row2.0
2nd row1.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.0 717
71.8%
3.0 188
 
18.8%
2.0 78
 
7.8%
4.0 11
 
1.1%
5.0 1
 
0.1%
(Missing) 3
 
0.3%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
1.0 717
72.1%
3.0 188
 
18.9%
2.0 78
 
7.8%
4.0 11
 
1.1%
5.0 1
 
0.1%

Most occurring characters

ValueCountFrequency (%)
. 995
33.3%
0 995
33.3%
1 717
24.0%
3 188
 
6.3%
2 78
 
2.6%
4 11
 
0.4%
5 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1990
66.7%
Other Punctuation 995
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 995
50.0%
1 717
36.0%
3 188
 
9.4%
2 78
 
3.9%
4 11
 
0.6%
5 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 995
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2985
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 995
33.3%
0 995
33.3%
1 717
24.0%
3 188
 
6.3%
2 78
 
2.6%
4 11
 
0.4%
5 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2985
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 995
33.3%
0 995
33.3%
1 717
24.0%
3 188
 
6.3%
2 78
 
2.6%
4 11
 
0.4%
5 1
 
< 0.1%

TIPO_DE_SEMILLA
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing998
Missing (%)100.0%
Memory size7.9 KiB

HABITO_CRECIMIENTO
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing998
Missing (%)100.0%
Memory size7.9 KiB
Distinct28
Distinct (%)2.8%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
Otro
284 
P3966 (Pioneer)
115 
DK 234
112 
DK 234 YGRR
110 
PIONEER 30F35 HRR
60 
Other values (23)
316 

Length

Max length19
Median length17
Mean length9.8094283
Min length4

Characters and Unicode

Total characters9780
Distinct characters47
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st rowPIONEER 30F32
2nd rowDK 234
3rd rowPIONEER 30F32
4th rowOtro
5th rowOtro

Common Values

ValueCountFrequency (%)
Otro 284
28.5%
P3966 (Pioneer) 115
11.5%
DK 234 112
 
11.2%
DK 234 YGRR 110
 
11.0%
PIONEER 30F35 HRR 60
 
6.0%
PAC 105 59
 
5.9%
P4082 (Pioneer) 56
 
5.6%
DK7088 39
 
3.9%
ADV 9339 (Syngenta) 31
 
3.1%
Impacto (Syngenta) 29
 
2.9%
Other values (18) 102
 
10.2%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
otro 284
14.9%
pioneer 280
14.7%
dk 226
11.8%
234 222
11.6%
p3966 115
 
6.0%
ygrr 110
 
5.8%
30f35 103
 
5.4%
syngenta 95
 
5.0%
hrr 60
 
3.1%
pac 59
 
3.1%
Other values (26) 354
18.6%

Most occurring characters

ValueCountFrequency (%)
911
 
9.3%
3 630
 
6.4%
P 511
 
5.2%
o 506
 
5.2%
r 457
 
4.7%
R 450
 
4.6%
e 439
 
4.5%
t 412
 
4.2%
O 395
 
4.0%
n 378
 
3.9%
Other values (37) 4691
48.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 3174
32.5%
Lowercase Letter 2815
28.8%
Decimal Number 2348
24.0%
Space Separator 911
 
9.3%
Close Punctuation 266
 
2.7%
Open Punctuation 266
 
2.7%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
P 511
16.1%
R 450
14.2%
O 395
12.4%
D 306
9.6%
K 267
8.4%
E 218
6.9%
I 149
 
4.7%
N 114
 
3.6%
S 113
 
3.6%
F 112
 
3.5%
Other values (8) 539
17.0%
Lowercase Letter
ValueCountFrequency (%)
o 506
18.0%
r 457
16.2%
e 439
15.6%
t 412
14.6%
n 378
13.4%
i 191
 
6.8%
a 130
 
4.6%
y 95
 
3.4%
g 95
 
3.4%
m 32
 
1.1%
Other values (6) 80
 
2.8%
Decimal Number
ValueCountFrequency (%)
3 630
26.8%
2 296
12.6%
4 284
12.1%
0 274
11.7%
6 236
 
10.1%
9 206
 
8.8%
5 172
 
7.3%
8 134
 
5.7%
1 77
 
3.3%
7 39
 
1.7%
Space Separator
ValueCountFrequency (%)
911
100.0%
Close Punctuation
ValueCountFrequency (%)
) 266
100.0%
Open Punctuation
ValueCountFrequency (%)
( 266
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5989
61.2%
Common 3791
38.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
P 511
 
8.5%
o 506
 
8.4%
r 457
 
7.6%
R 450
 
7.5%
e 439
 
7.3%
t 412
 
6.9%
O 395
 
6.6%
n 378
 
6.3%
D 306
 
5.1%
K 267
 
4.5%
Other values (24) 1868
31.2%
Common
ValueCountFrequency (%)
911
24.0%
3 630
16.6%
2 296
 
7.8%
4 284
 
7.5%
0 274
 
7.2%
) 266
 
7.0%
( 266
 
7.0%
6 236
 
6.2%
9 206
 
5.4%
5 172
 
4.5%
Other values (3) 250
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9780
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
911
 
9.3%
3 630
 
6.4%
P 511
 
5.2%
o 506
 
5.2%
r 457
 
4.7%
R 450
 
4.6%
e 439
 
4.5%
t 412
 
4.2%
O 395
 
4.0%
n 378
 
3.9%
Other values (37) 4691
48.0%

OBJ_RDT
Real number (ℝ)

MISSING  ZEROS 

Distinct71
Distinct (%)8.6%
Missing176
Missing (%)17.6%
Infinite0
Infinite (%)0.0%
Mean6327.1588
Minimum0
Maximum30000
Zeros10
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum0
5-th percentile2525
Q15500
median6500
Q37200
95-th percentile8000
Maximum30000
Range30000
Interquartile range (IQR)1700

Descriptive statistics

Standard deviation2840.2773
Coefficient of variation (CV)0.44890248
Kurtosis38.31064
Mean6327.1588
Median Absolute Deviation (MAD)700
Skewness4.3229626
Sum5200924.5
Variance8067175
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6000 134
13.4%
7000 127
12.7%
7500 79
7.9%
6500 74
 
7.4%
5000 68
 
6.8%
7800 37
 
3.7%
8000 32
 
3.2%
5500 27
 
2.7%
4500 26
 
2.6%
4000 21
 
2.1%
Other values (61) 197
19.7%
(Missing) 176
17.6%
ValueCountFrequency (%)
0 10
1.0%
3 1
 
0.1%
3.5 2
 
0.2%
4 2
 
0.2%
4.5 1
 
0.1%
5 20
2.0%
5.5 1
 
0.1%
6 2
 
0.2%
7 1
 
0.1%
7.5 1
 
0.1%
ValueCountFrequency (%)
30000 5
 
0.5%
29000 2
 
0.2%
18000 1
 
0.1%
15000 1
 
0.1%
9000 1
 
0.1%
8500 6
 
0.6%
8200 1
 
0.1%
8000 32
3.2%
7950 2
 
0.2%
7900 2
 
0.2%

CULT_ANT
Categorical

Distinct6
Distinct (%)0.6%
Missing5
Missing (%)0.5%
Memory size7.9 KiB
Maiz
486 
Algodón
466 
Pastos
 
22
Frijol
 
17
Yuca
 
1

Length

Max length7
Median length6
Mean length5.4874119
Min length4

Characters and Unicode

Total characters5449
Distinct characters20
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st rowAlgodón
2nd rowMaiz
3rd rowAlgodón
4th rowAlgodón
5th rowAlgodón

Common Values

ValueCountFrequency (%)
Maiz 486
48.7%
Algodón 466
46.7%
Pastos 22
 
2.2%
Frijol 17
 
1.7%
Yuca 1
 
0.1%
Arroz 1
 
0.1%
(Missing) 5
 
0.5%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
maiz 486
48.9%
algodón 466
46.9%
pastos 22
 
2.2%
frijol 17
 
1.7%
yuca 1
 
0.1%
arroz 1
 
0.1%

Most occurring characters

ValueCountFrequency (%)
a 509
9.3%
o 506
9.3%
i 503
9.2%
z 487
8.9%
M 486
8.9%
l 483
8.9%
A 467
8.6%
ó 466
8.6%
n 466
8.6%
d 466
8.6%
Other values (10) 610
11.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4456
81.8%
Uppercase Letter 993
 
18.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 509
11.4%
o 506
11.4%
i 503
11.3%
z 487
10.9%
l 483
10.8%
ó 466
10.5%
n 466
10.5%
d 466
10.5%
g 466
10.5%
s 44
 
1.0%
Other values (5) 60
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
M 486
48.9%
A 467
47.0%
P 22
 
2.2%
F 17
 
1.7%
Y 1
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 5449
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 509
9.3%
o 506
9.3%
i 503
9.2%
z 487
8.9%
M 486
8.9%
l 483
8.9%
A 467
8.6%
ó 466
8.6%
n 466
8.6%
d 466
8.6%
Other values (10) 610
11.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4983
91.4%
None 466
 
8.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 509
10.2%
o 506
10.2%
i 503
10.1%
z 487
9.8%
M 486
9.8%
l 483
9.7%
A 467
9.4%
n 466
9.4%
d 466
9.4%
g 466
9.4%
Other values (9) 144
 
2.9%
None
ValueCountFrequency (%)
ó 466
100.0%

DRENAJE
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
SI
595 
NO
403 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1996
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSI
2nd rowSI
3rd rowSI
4th rowSI
5th rowSI

Common Values

ValueCountFrequency (%)
SI 595
59.6%
NO 403
40.4%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
si 595
59.6%
no 403
40.4%

Most occurring characters

ValueCountFrequency (%)
S 595
29.8%
I 595
29.8%
N 403
20.2%
O 403
20.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1996
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 595
29.8%
I 595
29.8%
N 403
20.2%
O 403
20.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 1996
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 595
29.8%
I 595
29.8%
N 403
20.2%
O 403
20.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1996
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S 595
29.8%
I 595
29.8%
N 403
20.2%
O 403
20.2%

FECHA_EMERGENCIA
Categorical

Distinct205
Distinct (%)20.7%
Missing6
Missing (%)0.6%
Memory size7.9 KiB
5/25/2015
 
41
5/24/2015
 
35
5/30/2015
 
24
5/23/2015
 
22
5/28/2015
 
21
Other values (200)
849 

Length

Max length10
Median length9
Mean length8.9102823
Min length8

Characters and Unicode

Total characters8839
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)7.6%

Sample

1st row5/18/2013
2nd row5/7/2013
3rd row5/17/2013
4th row5/12/2013
5th row5/12/2013

Common Values

ValueCountFrequency (%)
5/25/2015 41
 
4.1%
5/24/2015 35
 
3.5%
5/30/2015 24
 
2.4%
5/23/2015 22
 
2.2%
5/28/2015 21
 
2.1%
5/26/2015 21
 
2.1%
5/20/2015 20
 
2.0%
5/22/2015 20
 
2.0%
5/14/2016 20
 
2.0%
5/15/2016 20
 
2.0%
Other values (195) 748
74.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
5/25/2015 41
 
4.1%
5/24/2015 35
 
3.5%
5/30/2015 24
 
2.4%
5/23/2015 22
 
2.2%
5/28/2015 21
 
2.1%
5/26/2015 21
 
2.1%
5/20/2015 20
 
2.0%
5/22/2015 20
 
2.0%
5/14/2016 20
 
2.0%
5/15/2016 20
 
2.0%
Other values (195) 748
75.4%

Most occurring characters

ValueCountFrequency (%)
/ 1984
22.4%
1 1539
17.4%
2 1489
16.8%
5 1396
15.8%
0 1265
14.3%
6 479
 
5.4%
4 179
 
2.0%
3 172
 
1.9%
9 165
 
1.9%
7 86
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6855
77.6%
Other Punctuation 1984
 
22.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1539
22.5%
2 1489
21.7%
5 1396
20.4%
0 1265
18.5%
6 479
 
7.0%
4 179
 
2.6%
3 172
 
2.5%
9 165
 
2.4%
7 86
 
1.3%
8 85
 
1.2%
Other Punctuation
ValueCountFrequency (%)
/ 1984
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8839
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
/ 1984
22.4%
1 1539
17.4%
2 1489
16.8%
5 1396
15.8%
0 1265
14.3%
6 479
 
5.4%
4 179
 
2.0%
3 172
 
1.9%
9 165
 
1.9%
7 86
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8839
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 1984
22.4%
1 1539
17.4%
2 1489
16.8%
5 1396
15.8%
0 1265
14.3%
6 479
 
5.4%
4 179
 
2.0%
3 172
 
1.9%
9 165
 
1.9%
7 86
 
1.0%

POBLACION_20DIAS
Real number (ℝ)

Distinct79
Distinct (%)8.0%
Missing7
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean64364.693
Minimum6000
Maximum180000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum6000
5-th percentile50000
Q160000
median62500
Q371000
95-th percentile75000
Maximum180000
Range174000
Interquartile range (IQR)11000

Descriptive statistics

Standard deviation10225.157
Coefficient of variation (CV)0.15886283
Kurtosis21.339861
Mean64364.693
Median Absolute Deviation (MAD)5100
Skewness1.2649307
Sum63785411
Variance1.0455384 × 108
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60000 232
23.2%
62000 72
 
7.2%
55000 70
 
7.0%
65000 69
 
6.9%
70000 69
 
6.9%
73000 61
 
6.1%
72000 60
 
6.0%
75000 48
 
4.8%
62500 37
 
3.7%
71000 33
 
3.3%
Other values (69) 240
24.0%
ValueCountFrequency (%)
6000 1
 
0.1%
13500 1
 
0.1%
15000 1
 
0.1%
25000 2
0.2%
30000 3
0.3%
32000 2
0.2%
33000 1
 
0.1%
37500 1
 
0.1%
41000 3
0.3%
41500 1
 
0.1%
ValueCountFrequency (%)
180000 1
 
0.1%
120000 3
0.3%
100000 2
0.2%
93331 2
0.2%
90000 4
0.4%
87500 2
0.2%
85700 1
 
0.1%
85000 1
 
0.1%
84000 4
0.4%
80500 1
 
0.1%

FECHA_FLORACION
Categorical

Distinct212
Distinct (%)21.4%
Missing6
Missing (%)0.6%
Memory size7.9 KiB
7/10/2015
 
33
7/16/2015
 
26
7/13/2015
 
22
7/11/2015
 
22
7/6/2015
 
21
Other values (207)
868 

Length

Max length10
Median length9
Mean length8.9586694
Min length8

Characters and Unicode

Total characters8887
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)8.4%

Sample

1st row7/20/2013
2nd row7/10/2013
3rd row7/15/2013
4th row7/15/2013
5th row7/14/2013

Common Values

ValueCountFrequency (%)
7/10/2015 33
 
3.3%
7/16/2015 26
 
2.6%
7/13/2015 22
 
2.2%
7/11/2015 22
 
2.2%
7/6/2015 21
 
2.1%
7/15/2015 21
 
2.1%
7/12/2015 21
 
2.1%
7/9/2015 20
 
2.0%
6/30/2016 20
 
2.0%
7/14/2015 18
 
1.8%
Other values (202) 768
77.0%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
7/10/2015 33
 
3.3%
7/16/2015 26
 
2.6%
7/13/2015 22
 
2.2%
7/11/2015 22
 
2.2%
7/6/2015 21
 
2.1%
7/15/2015 21
 
2.1%
7/12/2015 21
 
2.1%
7/9/2015 20
 
2.0%
6/30/2016 20
 
2.0%
7/14/2015 18
 
1.8%
Other values (202) 768
77.4%

Most occurring characters

ValueCountFrequency (%)
/ 1984
22.3%
1 1862
21.0%
2 1418
16.0%
0 1132
12.7%
5 746
 
8.4%
6 630
 
7.1%
7 617
 
6.9%
3 185
 
2.1%
4 137
 
1.5%
8 103
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6903
77.7%
Other Punctuation 1984
 
22.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1862
27.0%
2 1418
20.5%
0 1132
16.4%
5 746
10.8%
6 630
 
9.1%
7 617
 
8.9%
3 185
 
2.7%
4 137
 
2.0%
8 103
 
1.5%
9 73
 
1.1%
Other Punctuation
ValueCountFrequency (%)
/ 1984
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8887
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
/ 1984
22.3%
1 1862
21.0%
2 1418
16.0%
0 1132
12.7%
5 746
 
8.4%
6 630
 
7.1%
7 617
 
6.9%
3 185
 
2.1%
4 137
 
1.5%
8 103
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8887
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 1984
22.3%
1 1862
21.0%
2 1418
16.0%
0 1132
12.7%
5 746
 
8.4%
6 630
 
7.1%
7 617
 
6.9%
3 185
 
2.1%
4 137
 
1.5%
8 103
 
1.2%

FECHA_COSECHA
Categorical

HIGH CARDINALITY  MISSING 

Distinct226
Distinct (%)22.9%
Missing12
Missing (%)1.2%
Memory size7.9 KiB
9/28/2015
 
48
9/20/2016
 
29
10/5/2015
 
23
9/25/2015
 
21
9/19/2016
 
20
Other values (221)
845 

Length

Max length10
Median length9
Mean length8.9563895
Min length8

Characters and Unicode

Total characters8831
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)9.5%

Sample

1st row9/26/2013
2nd row9/11/2013
3rd row9/19/2013
4th row9/12/2013
5th row9/12/2013

Common Values

ValueCountFrequency (%)
9/28/2015 48
 
4.8%
9/20/2016 29
 
2.9%
10/5/2015 23
 
2.3%
9/25/2015 21
 
2.1%
9/19/2016 20
 
2.0%
9/30/2015 20
 
2.0%
9/15/2015 16
 
1.6%
9/21/2015 16
 
1.6%
9/29/2015 15
 
1.5%
9/27/2015 15
 
1.5%
Other values (216) 763
76.5%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
9/28/2015 48
 
4.9%
9/20/2016 29
 
2.9%
10/5/2015 23
 
2.3%
9/25/2015 21
 
2.1%
9/19/2016 20
 
2.0%
9/30/2015 20
 
2.0%
9/15/2015 16
 
1.6%
9/21/2015 16
 
1.6%
9/29/2015 15
 
1.5%
9/27/2015 15
 
1.5%
Other values (216) 763
77.4%

Most occurring characters

ValueCountFrequency (%)
/ 1972
22.3%
2 1683
19.1%
1 1591
18.0%
0 1292
14.6%
9 645
 
7.3%
5 566
 
6.4%
6 549
 
6.2%
8 172
 
1.9%
3 149
 
1.7%
4 130
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6859
77.7%
Other Punctuation 1972
 
22.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1683
24.5%
1 1591
23.2%
0 1292
18.8%
9 645
 
9.4%
5 566
 
8.3%
6 549
 
8.0%
8 172
 
2.5%
3 149
 
2.2%
4 130
 
1.9%
7 82
 
1.2%
Other Punctuation
ValueCountFrequency (%)
/ 1972
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8831
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
/ 1972
22.3%
2 1683
19.1%
1 1591
18.0%
0 1292
14.6%
9 645
 
7.3%
5 566
 
6.4%
6 549
 
6.2%
8 172
 
1.9%
3 149
 
1.7%
4 130
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8831
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 1972
22.3%
2 1683
19.1%
1 1591
18.0%
0 1292
14.6%
9 645
 
7.3%
5 566
 
6.4%
6 549
 
6.2%
8 172
 
1.9%
3 149
 
1.7%
4 130
 
1.5%

METODO_COSECHA
Categorical

Distinct2
Distinct (%)0.2%
Missing12
Missing (%)1.2%
Memory size7.9 KiB
Manual
676 
Mecanizada
310 

Length

Max length10
Median length6
Mean length7.2576065
Min length6

Characters and Unicode

Total characters7156
Distinct characters10
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowManual
2nd rowManual
3rd rowManual
4th rowManual
5th rowManual

Common Values

ValueCountFrequency (%)
Manual 676
67.7%
Mecanizada 310
31.1%
(Missing) 12
 
1.2%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
manual 676
68.6%
mecanizada 310
31.4%

Most occurring characters

ValueCountFrequency (%)
a 2282
31.9%
M 986
13.8%
n 986
13.8%
u 676
 
9.4%
l 676
 
9.4%
e 310
 
4.3%
c 310
 
4.3%
i 310
 
4.3%
z 310
 
4.3%
d 310
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6170
86.2%
Uppercase Letter 986
 
13.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 2282
37.0%
n 986
16.0%
u 676
 
11.0%
l 676
 
11.0%
e 310
 
5.0%
c 310
 
5.0%
i 310
 
5.0%
z 310
 
5.0%
d 310
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
M 986
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7156
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 2282
31.9%
M 986
13.8%
n 986
13.8%
u 676
 
9.4%
l 676
 
9.4%
e 310
 
4.3%
c 310
 
4.3%
i 310
 
4.3%
z 310
 
4.3%
d 310
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7156
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 2282
31.9%
M 986
13.8%
n 986
13.8%
u 676
 
9.4%
l 676
 
9.4%
e 310
 
4.3%
c 310
 
4.3%
i 310
 
4.3%
z 310
 
4.3%
d 310
 
4.3%

RDT
Real number (ℝ)

Distinct176
Distinct (%)17.8%
Missing12
Missing (%)1.2%
Infinite0
Infinite (%)0.0%
Mean6124.3884
Minimum1200
Maximum68000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum1200
5-th percentile2200
Q14000
median5100
Q36200
95-th percentile15750
Maximum68000
Range66800
Interquartile range (IQR)2200

Descriptive statistics

Standard deviation5735.8886
Coefficient of variation (CV)0.93656511
Kurtosis33.039335
Mean6124.3884
Median Absolute Deviation (MAD)1100
Skewness5.0267859
Sum6038647
Variance32900417
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4000 49
 
4.9%
5000 48
 
4.8%
5500 41
 
4.1%
4500 38
 
3.8%
6000 31
 
3.1%
6500 29
 
2.9%
5200 26
 
2.6%
3500 26
 
2.6%
5800 23
 
2.3%
4800 23
 
2.3%
Other values (166) 652
65.3%
ValueCountFrequency (%)
1200 6
0.6%
1300 1
 
0.1%
1400 1
 
0.1%
1500 1
 
0.1%
1600 3
0.3%
1700 2
 
0.2%
1800 7
0.7%
1900 2
 
0.2%
1990 1
 
0.1%
2000 7
0.7%
ValueCountFrequency (%)
68000 1
 
0.1%
60580 1
 
0.1%
45600 1
 
0.1%
39800 1
 
0.1%
38000 1
 
0.1%
37000 3
0.3%
36000 2
0.2%
32000 2
0.2%
30000 4
0.4%
29000 2
0.2%

PROD_COSECHADO
Categorical

IMBALANCE  MISSING 

Distinct3
Distinct (%)0.3%
Missing12
Missing (%)1.2%
Memory size7.9 KiB
Grano seco
885 
Ensilaje
 
64
Mazorca (fresca)
 
37

Length

Max length16
Median length10
Mean length10.095335
Min length8

Characters and Unicode

Total characters9954
Distinct characters18
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGrano seco
2nd rowGrano seco
3rd rowGrano seco
4th rowGrano seco
5th rowGrano seco

Common Values

ValueCountFrequency (%)
Grano seco 885
88.7%
Ensilaje 64
 
6.4%
Mazorca (fresca) 37
 
3.7%
(Missing) 12
 
1.2%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
grano 885
46.4%
seco 885
46.4%
ensilaje 64
 
3.4%
mazorca 37
 
1.9%
fresca 37
 
1.9%

Most occurring characters

ValueCountFrequency (%)
o 1807
18.2%
a 1060
10.6%
s 986
9.9%
e 986
9.9%
c 959
9.6%
r 959
9.6%
n 949
9.5%
922
9.3%
G 885
8.9%
l 64
 
0.6%
Other values (8) 377
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7972
80.1%
Uppercase Letter 986
 
9.9%
Space Separator 922
 
9.3%
Open Punctuation 37
 
0.4%
Close Punctuation 37
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 1807
22.7%
a 1060
13.3%
s 986
12.4%
e 986
12.4%
c 959
12.0%
r 959
12.0%
n 949
11.9%
l 64
 
0.8%
j 64
 
0.8%
i 64
 
0.8%
Other values (2) 74
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
G 885
89.8%
E 64
 
6.5%
M 37
 
3.8%
Space Separator
ValueCountFrequency (%)
922
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8958
90.0%
Common 996
 
10.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 1807
20.2%
a 1060
11.8%
s 986
11.0%
e 986
11.0%
c 959
10.7%
r 959
10.7%
n 949
10.6%
G 885
9.9%
l 64
 
0.7%
j 64
 
0.7%
Other values (5) 239
 
2.7%
Common
ValueCountFrequency (%)
922
92.6%
( 37
 
3.7%
) 37
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9954
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 1807
18.2%
a 1060
10.6%
s 986
9.9%
e 986
9.9%
c 959
9.6%
r 959
9.6%
n 949
9.5%
922
9.3%
G 885
8.9%
l 64
 
0.6%
Other values (8) 377
 
3.8%

NOMBRE_LOTE
Categorical

Distinct727
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Chuchurubi
 
17
LA ESPERANZA
 
13
El Zapal
 
11
MAIZ
 
8
EL HIGO
 
8
Other values (722)
941 

Length

Max length31
Median length18
Mean length9.6683367
Min length3

Characters and Unicode

Total characters9649
Distinct characters64
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique596 ?
Unique (%)59.7%

Sample

1st rowVILLA GABRIELA
2nd rowVILLA LOURDES
3rd rowSANTA MARTA
4th rowPALMAR
5th rowDONDE FIDEL

Common Values

ValueCountFrequency (%)
Chuchurubi 17
 
1.7%
LA ESPERANZA 13
 
1.3%
El Zapal 11
 
1.1%
MAIZ 8
 
0.8%
EL HIGO 8
 
0.8%
Carolina 8
 
0.8%
EL ROBLE 8
 
0.8%
LOTE 1 8
 
0.8%
EL CANAL 7
 
0.7%
SANTA LUCIA 6
 
0.6%
Other values (717) 904
90.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
el 205
 
10.5%
la 178
 
9.2%
2 92
 
4.7%
1 88
 
4.5%
villa 41
 
2.1%
las 36
 
1.9%
san 36
 
1.9%
los 35
 
1.8%
santa 30
 
1.5%
esperanza 25
 
1.3%
Other values (523) 1178
60.6%

Most occurring characters

ValueCountFrequency (%)
970
 
10.1%
A 900
 
9.3%
L 658
 
6.8%
a 556
 
5.8%
E 498
 
5.2%
O 462
 
4.8%
I 387
 
4.0%
S 331
 
3.4%
l 329
 
3.4%
R 317
 
3.3%
Other values (54) 4241
44.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 5492
56.9%
Lowercase Letter 2956
30.6%
Space Separator 970
 
10.1%
Decimal Number 227
 
2.4%
Other Punctuation 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 556
18.8%
l 329
11.1%
o 282
9.5%
i 264
8.9%
e 235
7.9%
r 217
 
7.3%
n 183
 
6.2%
t 129
 
4.4%
c 120
 
4.1%
s 117
 
4.0%
Other values (18) 524
17.7%
Uppercase Letter
ValueCountFrequency (%)
A 900
16.4%
L 658
12.0%
E 498
9.1%
O 462
8.4%
I 387
 
7.0%
S 331
 
6.0%
R 317
 
5.8%
C 311
 
5.7%
N 274
 
5.0%
T 219
 
4.0%
Other values (15) 1135
20.7%
Decimal Number
ValueCountFrequency (%)
1 100
44.1%
2 97
42.7%
3 11
 
4.8%
4 7
 
3.1%
5 6
 
2.6%
0 4
 
1.8%
6 1
 
0.4%
9 1
 
0.4%
Space Separator
ValueCountFrequency (%)
970
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8448
87.6%
Common 1201
 
12.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 900
 
10.7%
L 658
 
7.8%
a 556
 
6.6%
E 498
 
5.9%
O 462
 
5.5%
I 387
 
4.6%
S 331
 
3.9%
l 329
 
3.9%
R 317
 
3.8%
C 311
 
3.7%
Other values (43) 3699
43.8%
Common
ValueCountFrequency (%)
970
80.8%
1 100
 
8.3%
2 97
 
8.1%
3 11
 
0.9%
4 7
 
0.6%
5 6
 
0.5%
0 4
 
0.3%
. 3
 
0.2%
- 1
 
0.1%
6 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9631
99.8%
None 18
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
970
 
10.1%
A 900
 
9.3%
L 658
 
6.8%
a 556
 
5.8%
E 498
 
5.2%
O 462
 
4.8%
I 387
 
4.0%
S 331
 
3.4%
l 329
 
3.4%
R 317
 
3.3%
Other values (50) 4223
43.8%
None
ValueCountFrequency (%)
Ñ 14
77.8%
ñ 2
 
11.1%
ó 1
 
5.6%
è 1
 
5.6%

ORIGEN_SEMILLA
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing998
Missing (%)100.0%
Memory size7.9 KiB

INOCULACION_SEMILLAS
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing998
Missing (%)100.0%
Memory size7.9 KiB

NUEVA_INOCULACION_SEMILLAS
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing998
Missing (%)100.0%
Memory size7.9 KiB

PRODUCTO_USADO
Categorical

Distinct5
Distinct (%)1.2%
Missing571
Missing (%)57.2%
Memory size7.9 KiB
Insecticidas
301 
Fungicida+Insecticida
95 
Fungicidas
 
17
Desconocido
 
11
Otro
 
3

Length

Max length21
Median length12
Mean length13.840749
Min length4

Characters and Unicode

Total characters5910
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowInsecticidas
2nd rowInsecticidas
3rd rowInsecticidas
4th rowInsecticidas
5th rowInsecticidas

Common Values

ValueCountFrequency (%)
Insecticidas 301
30.2%
Fungicida+Insecticida 95
 
9.5%
Fungicidas 17
 
1.7%
Desconocido 11
 
1.1%
Otro 3
 
0.3%
(Missing) 571
57.2%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
insecticidas 301
70.5%
fungicida+insecticida 95
 
22.2%
fungicidas 17
 
4.0%
desconocido 11
 
2.6%
otro 3
 
0.7%

Most occurring characters

ValueCountFrequency (%)
i 1027
17.4%
c 926
15.7%
s 725
12.3%
n 519
8.8%
d 519
8.8%
a 508
8.6%
e 407
 
6.9%
t 399
 
6.8%
I 396
 
6.7%
F 112
 
1.9%
Other values (7) 372
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5293
89.6%
Uppercase Letter 522
 
8.8%
Math Symbol 95
 
1.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 1027
19.4%
c 926
17.5%
s 725
13.7%
n 519
9.8%
d 519
9.8%
a 508
9.6%
e 407
 
7.7%
t 399
 
7.5%
u 112
 
2.1%
g 112
 
2.1%
Other values (2) 39
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
I 396
75.9%
F 112
 
21.5%
D 11
 
2.1%
O 3
 
0.6%
Math Symbol
ValueCountFrequency (%)
+ 95
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5815
98.4%
Common 95
 
1.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 1027
17.7%
c 926
15.9%
s 725
12.5%
n 519
8.9%
d 519
8.9%
a 508
8.7%
e 407
 
7.0%
t 399
 
6.9%
I 396
 
6.8%
F 112
 
1.9%
Other values (6) 277
 
4.8%
Common
ValueCountFrequency (%)
+ 95
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5910
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 1027
17.4%
c 926
15.7%
s 725
12.3%
n 519
8.8%
d 519
8.8%
a 508
8.6%
e 407
 
6.9%
t 399
 
6.8%
I 396
 
6.7%
F 112
 
1.9%
Other values (7) 372
 
6.3%

TIPO_MATERIAL
Categorical

Distinct4
Distinct (%)0.4%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
Hibrido
498 
OGM
479 
Variedad
 
11
Semilla Campesina
 
9

Length

Max length17
Median length8
Mean length5.1795386
Min length3

Characters and Unicode

Total characters5164
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHibrido
2nd rowHibrido
3rd rowHibrido
4th rowHibrido
5th rowHibrido

Common Values

ValueCountFrequency (%)
Hibrido 498
49.9%
OGM 479
48.0%
Variedad 11
 
1.1%
Semilla Campesina 9
 
0.9%
(Missing) 1
 
0.1%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
hibrido 498
49.5%
ogm 479
47.6%
variedad 11
 
1.1%
semilla 9
 
0.9%
campesina 9
 
0.9%

Most occurring characters

ValueCountFrequency (%)
i 1025
19.8%
d 520
10.1%
r 509
9.9%
H 498
9.6%
b 498
9.6%
o 498
9.6%
O 479
9.3%
G 479
9.3%
M 479
9.3%
a 49
 
0.9%
Other values (10) 130
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3191
61.8%
Uppercase Letter 1964
38.0%
Space Separator 9
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 1025
32.1%
d 520
16.3%
r 509
16.0%
b 498
15.6%
o 498
15.6%
a 49
 
1.5%
e 29
 
0.9%
m 18
 
0.6%
l 18
 
0.6%
p 9
 
0.3%
Other values (2) 18
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
H 498
25.4%
O 479
24.4%
G 479
24.4%
M 479
24.4%
V 11
 
0.6%
S 9
 
0.5%
C 9
 
0.5%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5155
99.8%
Common 9
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 1025
19.9%
d 520
10.1%
r 509
9.9%
H 498
9.7%
b 498
9.7%
o 498
9.7%
O 479
9.3%
G 479
9.3%
M 479
9.3%
a 49
 
1.0%
Other values (9) 121
 
2.3%
Common
ValueCountFrequency (%)
9
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5164
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 1025
19.8%
d 520
10.1%
r 509
9.9%
H 498
9.6%
b 498
9.6%
o 498
9.6%
O 479
9.3%
G 479
9.3%
M 479
9.3%
a 49
 
0.9%
Other values (10) 130
 
2.5%

NUEVO_MATERIAL_GENETICO
Categorical

HIGH CARDINALITY  MISSING 

Distinct82
Distinct (%)28.7%
Missing712
Missing (%)71.3%
Memory size7.9 KiB
SV 1035
24 
SV-1035
23 
P 3966 WH
21 
DK 234 VTPRO
21 
DK 234 VTPro
19 
Other values (77)
178 

Length

Max length26
Median length23
Mean length8.9965035
Min length4

Characters and Unicode

Total characters2573
Distinct characters52
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)16.8%

Sample

1st row7019
2nd row7019
3rd row1430
4th rowSV 8726
5th rowSV-1035

Common Values

ValueCountFrequency (%)
SV 1035 24
 
2.4%
SV-1035 23
 
2.3%
P 3966 WH 21
 
2.1%
DK 234 VTPRO 21
 
2.1%
DK 234 VTPro 19
 
1.9%
SV 7019 12
 
1.2%
P 4082 WH 9
 
0.9%
SV - 1035 8
 
0.8%
DK 234 VT PRO 8
 
0.8%
SV-7019 7
 
0.7%
Other values (72) 134
 
13.4%
(Missing) 712
71.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
sv 63
 
10.3%
dk 55
 
9.0%
234 55
 
9.0%
1035 50
 
8.2%
wh 48
 
7.9%
vtpro 47
 
7.7%
p 35
 
5.7%
3966 24
 
3.9%
sv-1035 23
 
3.8%
7019 20
 
3.3%
Other values (63) 190
31.1%

Most occurring characters

ValueCountFrequency (%)
326
 
12.7%
3 219
 
8.5%
V 149
 
5.8%
0 141
 
5.5%
P 127
 
4.9%
1 109
 
4.2%
S 94
 
3.7%
2 94
 
3.7%
9 94
 
3.7%
4 91
 
3.5%
Other values (42) 1129
43.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 994
38.6%
Decimal Number 963
37.4%
Space Separator 326
 
12.7%
Lowercase Letter 232
 
9.0%
Dash Punctuation 58
 
2.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
V 149
15.0%
P 127
12.8%
S 94
9.5%
T 90
9.1%
D 72
7.2%
R 71
7.1%
O 59
 
5.9%
W 57
 
5.7%
H 56
 
5.6%
K 56
 
5.6%
Other values (11) 163
16.4%
Lowercase Letter
ValueCountFrequency (%)
r 39
16.8%
o 38
16.4%
i 20
8.6%
a 20
8.6%
s 17
7.3%
c 16
6.9%
p 15
 
6.5%
t 13
 
5.6%
l 12
 
5.2%
e 10
 
4.3%
Other values (9) 32
13.8%
Decimal Number
ValueCountFrequency (%)
3 219
22.7%
0 141
14.6%
1 109
11.3%
2 94
9.8%
9 94
9.8%
4 91
9.4%
5 83
 
8.6%
6 67
 
7.0%
8 34
 
3.5%
7 31
 
3.2%
Space Separator
ValueCountFrequency (%)
326
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1347
52.4%
Latin 1226
47.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
V 149
 
12.2%
P 127
 
10.4%
S 94
 
7.7%
T 90
 
7.3%
D 72
 
5.9%
R 71
 
5.8%
O 59
 
4.8%
W 57
 
4.6%
H 56
 
4.6%
K 56
 
4.6%
Other values (30) 395
32.2%
Common
ValueCountFrequency (%)
326
24.2%
3 219
16.3%
0 141
10.5%
1 109
 
8.1%
2 94
 
7.0%
9 94
 
7.0%
4 91
 
6.8%
5 83
 
6.2%
6 67
 
5.0%
- 58
 
4.3%
Other values (2) 65
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2573
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
326
 
12.7%
3 219
 
8.5%
V 149
 
5.8%
0 141
 
5.5%
P 127
 
4.9%
1 109
 
4.2%
S 94
 
3.7%
2 94
 
3.7%
9 94
 
3.7%
4 91
 
3.5%
Other values (42) 1129
43.9%

OTRO_CULT_ANT
Categorical

MISSING  UNIFORM 

Distinct4
Distinct (%)100.0%
Missing994
Missing (%)99.6%
Memory size7.9 KiB
algodón y frijol
frijol y algodón
frijol y hortaliza
HORTALIZAS

Length

Max length19
Median length17
Mean length15.75
Min length10

Characters and Unicode

Total characters63
Distinct characters25
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)100.0%

Sample

1st rowalgodón y frijol
2nd rowfrijol y algodón
3rd rowfrijol y hortaliza
4th rowHORTALIZAS

Common Values

ValueCountFrequency (%)
algodón y frijol 1
 
0.1%
frijol y algodón 1
 
0.1%
frijol y hortaliza 1
 
0.1%
HORTALIZAS 1
 
0.1%
(Missing) 994
99.6%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
y 3
30.0%
frijol 3
30.0%
algodón 2
20.0%
hortaliza 1
 
10.0%
hortalizas 1
 
10.0%

Most occurring characters

ValueCountFrequency (%)
9
14.3%
l 6
 
9.5%
o 6
 
9.5%
a 4
 
6.3%
i 4
 
6.3%
r 4
 
6.3%
y 3
 
4.8%
f 3
 
4.8%
j 3
 
4.8%
n 2
 
3.2%
Other values (15) 19
30.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 44
69.8%
Uppercase Letter 10
 
15.9%
Space Separator 9
 
14.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 6
13.6%
o 6
13.6%
a 4
9.1%
i 4
9.1%
r 4
9.1%
y 3
6.8%
f 3
6.8%
j 3
6.8%
n 2
 
4.5%
ó 2
 
4.5%
Other values (5) 7
15.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
20.0%
H 1
10.0%
O 1
10.0%
R 1
10.0%
T 1
10.0%
L 1
10.0%
I 1
10.0%
Z 1
10.0%
S 1
10.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 54
85.7%
Common 9
 
14.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 6
 
11.1%
o 6
 
11.1%
a 4
 
7.4%
i 4
 
7.4%
r 4
 
7.4%
y 3
 
5.6%
f 3
 
5.6%
j 3
 
5.6%
n 2
 
3.7%
ó 2
 
3.7%
Other values (14) 17
31.5%
Common
ValueCountFrequency (%)
9
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 61
96.8%
None 2
 
3.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9
14.8%
l 6
 
9.8%
o 6
 
9.8%
a 4
 
6.6%
i 4
 
6.6%
r 4
 
6.6%
y 3
 
4.9%
f 3
 
4.9%
j 3
 
4.9%
n 2
 
3.3%
Other values (14) 17
27.9%
None
ValueCountFrequency (%)
ó 2
100.0%

RESIEMBRA
Real number (ℝ)

MISSING  ZEROS 

Distinct8
Distinct (%)3.7%
Missing781
Missing (%)78.3%
Infinite0
Infinite (%)0.0%
Mean0.87557604
Minimum0
Maximum30
Zeros123
Zeros (%)12.3%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2.2
Maximum30
Range30
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.6155169
Coefficient of variation (CV)2.9871956
Kurtosis84.533622
Mean0.87557604
Median Absolute Deviation (MAD)0
Skewness8.5138422
Sum190
Variance6.8409285
MonotonicityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 123
 
12.3%
1 64
 
6.4%
2 19
 
1.9%
3 6
 
0.6%
5 2
 
0.2%
30 1
 
0.1%
10 1
 
0.1%
20 1
 
0.1%
(Missing) 781
78.3%
ValueCountFrequency (%)
0 123
12.3%
1 64
6.4%
2 19
 
1.9%
3 6
 
0.6%
5 2
 
0.2%
10 1
 
0.1%
20 1
 
0.1%
30 1
 
0.1%
ValueCountFrequency (%)
30 1
 
0.1%
20 1
 
0.1%
10 1
 
0.1%
5 2
 
0.2%
3 6
 
0.6%
2 19
 
1.9%
1 64
6.4%
0 123
12.3%

CANTIDAD_TOTAL
Real number (ℝ)

MISSING  SKEWED 

Distinct442
Distinct (%)46.6%
Missing50
Missing (%)5.0%
Infinite0
Infinite (%)0.0%
Mean31145.689
Minimum1200
Maximum3260400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum1200
5-th percentile3200
Q15700
median11020
Q323212.5
95-th percentile96000
Maximum3260400
Range3259200
Interquartile range (IQR)17512.5

Descriptive statistics

Standard deviation122193.64
Coefficient of variation (CV)3.9232921
Kurtosis522.1236
Mean31145.689
Median Absolute Deviation (MAD)6220
Skewness20.587047
Sum29526113
Variance1.4931285 × 1010
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4500 21
 
2.1%
4000 21
 
2.1%
12000 14
 
1.4%
6000 14
 
1.4%
4800 14
 
1.4%
5000 12
 
1.2%
11000 12
 
1.2%
3500 11
 
1.1%
5700 11
 
1.1%
10000 10
 
1.0%
Other values (432) 808
81.0%
(Missing) 50
 
5.0%
ValueCountFrequency (%)
1200 3
0.3%
1600 1
 
0.1%
1700 1
 
0.1%
1800 5
0.5%
2000 5
0.5%
2100 2
 
0.2%
2200 5
0.5%
2300 2
 
0.2%
2400 4
0.4%
2500 4
0.4%
ValueCountFrequency (%)
3260400 1
0.1%
1066640 1
0.1%
540000 1
0.1%
518000 1
0.1%
510000 1
0.1%
406000 1
0.1%
382250 1
0.1%
375000 1
0.1%
315000 1
0.1%
302400 1
0.1%

HUMEDAD
Real number (ℝ)

Distinct17
Distinct (%)1.9%
Missing113
Missing (%)11.3%
Infinite0
Infinite (%)0.0%
Mean18.952655
Minimum14
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum14
5-th percentile17
Q118
median19
Q320
95-th percentile22
Maximum34
Range20
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.5167713
Coefficient of variation (CV)0.08002949
Kurtosis11.733087
Mean18.952655
Median Absolute Deviation (MAD)1
Skewness1.6732262
Sum16773.1
Variance2.3005953
MonotonicityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
18 265
26.6%
19 232
23.2%
20 188
18.8%
17 96
 
9.6%
22 40
 
4.0%
21 37
 
3.7%
23 6
 
0.6%
16 6
 
0.6%
24 3
 
0.3%
25 2
 
0.2%
Other values (7) 10
 
1.0%
(Missing) 113
11.3%
ValueCountFrequency (%)
14 2
 
0.2%
15 2
 
0.2%
15.5 2
 
0.2%
16 6
 
0.6%
17 96
 
9.6%
18 265
26.6%
18.5 1
 
0.1%
19 232
23.2%
20 188
18.8%
21 37
 
3.7%
ValueCountFrequency (%)
34 1
 
0.1%
26 1
 
0.1%
25 2
 
0.2%
24 3
 
0.3%
23 6
 
0.6%
22.6 1
 
0.1%
22 40
 
4.0%
21 37
 
3.7%
20 188
18.8%
19 232
23.2%
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
NO
759 
SI
239 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters1996
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNO
2nd rowNO
3rd rowNO
4th rowNO
5th rowNO

Common Values

ValueCountFrequency (%)
NO 759
76.1%
SI 239
 
23.9%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
no 759
76.1%
si 239
 
23.9%

Most occurring characters

ValueCountFrequency (%)
N 759
38.0%
O 759
38.0%
S 239
 
12.0%
I 239
 
12.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1996
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 759
38.0%
O 759
38.0%
S 239
 
12.0%
I 239
 
12.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1996
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 759
38.0%
O 759
38.0%
S 239
 
12.0%
I 239
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1996
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 759
38.0%
O 759
38.0%
S 239
 
12.0%
I 239
 
12.0%

OBSERVACIONES_COSECHA
Categorical

HIGH CARDINALITY  MISSING 

Distinct132
Distinct (%)44.7%
Missing703
Missing (%)70.4%
Memory size7.9 KiB
estres hidrico
35 
déficit de lluvia
29 
VERANO
24 
estrés hidrico
19 
El verano afecto el rendimiento
 
10
Other values (127)
178 

Length

Max length200
Median length109
Mean length30.677966
Min length6

Characters and Unicode

Total characters9050
Distinct characters66
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)36.6%

Sample

1st rowPRODUCCIÓN AFECTADA POR SEQUÍA Y VOLCAMIENTO POR FUERTES VIENTOS
2nd rowPRODUCCIÓN AFECTADA POR SEQUÍA
3rd rowPRODUCCIÓN AFECTADA POR SEQUÍA
4th rowPRODUCCIÓN AFECTADA POR SEQUÍA EN LA ZONA
5th rowel intenso verano que se presento en el mes de junio

Common Values

ValueCountFrequency (%)
estres hidrico 35
 
3.5%
déficit de lluvia 29
 
2.9%
VERANO 24
 
2.4%
estrés hidrico 19
 
1.9%
El verano afecto el rendimiento 10
 
1.0%
Verano afecto produccion 9
 
0.9%
El verano afecto rendimiento 7
 
0.7%
estrés hidrico. 7
 
0.7%
El verano afecto los rendimientos 6
 
0.6%
El Verano 5
 
0.5%
Other values (122) 144
 
14.4%
(Missing) 703
70.4%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
verano 136
 
9.1%
el 120
 
8.1%
hidrico 84
 
5.6%
de 74
 
5.0%
afecto 63
 
4.2%
la 49
 
3.3%
en 48
 
3.2%
estrés 46
 
3.1%
rendimiento 39
 
2.6%
estres 38
 
2.6%
Other values (241) 791
53.2%

Most occurring characters

ValueCountFrequency (%)
1245
 
13.8%
e 779
 
8.6%
o 574
 
6.3%
i 503
 
5.6%
a 472
 
5.2%
r 455
 
5.0%
n 420
 
4.6%
t 346
 
3.8%
l 333
 
3.7%
s 333
 
3.7%
Other values (56) 3590
39.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5801
64.1%
Uppercase Letter 1922
 
21.2%
Space Separator 1245
 
13.8%
Other Punctuation 71
 
0.8%
Decimal Number 7
 
0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 779
13.4%
o 574
9.9%
i 503
 
8.7%
a 472
 
8.1%
r 455
 
7.8%
n 420
 
7.2%
t 346
 
6.0%
l 333
 
5.7%
s 333
 
5.7%
c 315
 
5.4%
Other values (20) 1271
21.9%
Uppercase Letter
ValueCountFrequency (%)
E 301
15.7%
A 211
11.0%
O 199
10.4%
N 160
 
8.3%
R 148
 
7.7%
I 106
 
5.5%
L 86
 
4.5%
T 84
 
4.4%
D 83
 
4.3%
C 82
 
4.3%
Other values (18) 462
24.0%
Decimal Number
ValueCountFrequency (%)
2 3
42.9%
0 3
42.9%
1 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
. 51
71.8%
, 20
 
28.2%
Space Separator
ValueCountFrequency (%)
1245
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7723
85.3%
Common 1327
 
14.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 779
 
10.1%
o 574
 
7.4%
i 503
 
6.5%
a 472
 
6.1%
r 455
 
5.9%
n 420
 
5.4%
t 346
 
4.5%
l 333
 
4.3%
s 333
 
4.3%
c 315
 
4.1%
Other values (48) 3193
41.3%
Common
ValueCountFrequency (%)
1245
93.8%
. 51
 
3.8%
, 20
 
1.5%
2 3
 
0.2%
0 3
 
0.2%
( 2
 
0.2%
) 2
 
0.2%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8909
98.4%
None 141
 
1.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1245
 
14.0%
e 779
 
8.7%
o 574
 
6.4%
i 503
 
5.6%
a 472
 
5.3%
r 455
 
5.1%
n 420
 
4.7%
t 346
 
3.9%
l 333
 
3.7%
s 333
 
3.7%
Other values (45) 3449
38.7%
None
ValueCountFrequency (%)
é 79
56.0%
ó 25
 
17.7%
í 8
 
5.7%
Ó 7
 
5.0%
É 6
 
4.3%
ñ 5
 
3.5%
Í 5
 
3.5%
á 3
 
2.1%
Ñ 1
 
0.7%
Ú 1
 
0.7%

DEPARTAMENTO
Categorical

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
CÓRDOBA
998 

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters6986
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCÓRDOBA
2nd rowCÓRDOBA
3rd rowCÓRDOBA
4th rowCÓRDOBA
5th rowCÓRDOBA

Common Values

ValueCountFrequency (%)
CÓRDOBA 998
100.0%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
córdoba 998
100.0%

Most occurring characters

ValueCountFrequency (%)
C 998
14.3%
Ó 998
14.3%
R 998
14.3%
D 998
14.3%
O 998
14.3%
B 998
14.3%
A 998
14.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 6986
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 998
14.3%
Ó 998
14.3%
R 998
14.3%
D 998
14.3%
O 998
14.3%
B 998
14.3%
A 998
14.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 6986
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 998
14.3%
Ó 998
14.3%
R 998
14.3%
D 998
14.3%
O 998
14.3%
B 998
14.3%
A 998
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5988
85.7%
None 998
 
14.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 998
16.7%
R 998
16.7%
D 998
16.7%
O 998
16.7%
B 998
16.7%
A 998
16.7%
None
ValueCountFrequency (%)
Ó 998
100.0%

MUNICIPIO
Categorical

Distinct13
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
COTORRA
204 
CERETÉ
198 
LORICA
145 
CHIMÁ
128 
CIÉNAGA DE ORO
91 
Other values (8)
232 

Length

Max length25
Median length14
Mean length7.5941884
Min length5

Characters and Unicode

Total characters7579
Distinct characters24
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCERETÉ
2nd rowCERETÉ
3rd rowCERETÉ
4th rowCERETÉ
5th rowCERETÉ

Common Values

ValueCountFrequency (%)
COTORRA 204
20.4%
CERETÉ 198
19.8%
LORICA 145
14.5%
CHIMÁ 128
12.8%
CIÉNAGA DE ORO 91
9.1%
SAN PELAYO 76
 
7.6%
SAN CARLOS 59
 
5.9%
MONTERÍA 50
 
5.0%
CHINÚ 22
 
2.2%
VALENCIA 14
 
1.4%
Other values (3) 11
 
1.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
cotorra 204
15.3%
cereté 198
14.8%
lorica 145
10.9%
san 135
10.1%
chimá 128
9.6%
de 98
7.3%
ciénaga 91
6.8%
oro 91
6.8%
pelayo 76
 
5.7%
carlos 59
 
4.4%
Other values (8) 111
8.3%

Most occurring characters

ValueCountFrequency (%)
R 962
12.7%
O 927
12.2%
A 901
11.9%
C 882
11.6%
E 643
8.5%
T 456
 
6.0%
I 416
 
5.5%
338
 
4.5%
N 328
 
4.3%
L 303
 
4.0%
Other values (14) 1423
18.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 7241
95.5%
Space Separator 338
 
4.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 962
13.3%
O 927
12.8%
A 901
12.4%
C 882
12.2%
E 643
8.9%
T 456
6.3%
I 416
 
5.7%
N 328
 
4.5%
L 303
 
4.2%
É 289
 
4.0%
Other values (13) 1134
15.7%
Space Separator
ValueCountFrequency (%)
338
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7241
95.5%
Common 338
 
4.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 962
13.3%
O 927
12.8%
A 901
12.4%
C 882
12.2%
E 643
8.9%
T 456
6.3%
I 416
 
5.7%
N 328
 
4.5%
L 303
 
4.2%
É 289
 
4.0%
Other values (13) 1134
15.7%
Common
ValueCountFrequency (%)
338
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7074
93.3%
None 505
 
6.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
R 962
13.6%
O 927
13.1%
A 901
12.7%
C 882
12.5%
E 643
9.1%
T 456
6.4%
I 416
5.9%
338
 
4.8%
N 328
 
4.6%
L 303
 
4.3%
Other values (9) 918
13.0%
None
ValueCountFrequency (%)
É 289
57.2%
Á 128
25.3%
Í 57
 
11.3%
Ú 24
 
4.8%
Ó 7
 
1.4%

AREA
Real number (ℝ)

Distinct101
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.0477956
Minimum0.5
Maximum71.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB

Quantile statistics

Minimum0.5
5-th percentile1
Q11.1
median2
Q34
95-th percentile14.12
Maximum71.5
Range71
Interquartile range (IQR)2.9

Descriptive statistics

Standard deviation5.9282271
Coefficient of variation (CV)1.4645569
Kurtosis35.052128
Mean4.0477956
Median Absolute Deviation (MAD)1
Skewness4.9134087
Sum4039.7
Variance35.143877
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 240
24.0%
2 143
14.3%
3 81
 
8.1%
1.5 65
 
6.5%
4 52
 
5.2%
5 27
 
2.7%
2.5 26
 
2.6%
1.3 24
 
2.4%
1.2 22
 
2.2%
6 22
 
2.2%
Other values (91) 296
29.7%
ValueCountFrequency (%)
0.5 1
 
0.1%
0.7 1
 
0.1%
0.9 2
 
0.2%
1 240
24.0%
1.1 10
 
1.0%
1.2 22
 
2.2%
1.25 2
 
0.2%
1.3 24
 
2.4%
1.4 7
 
0.7%
1.5 65
 
6.5%
ValueCountFrequency (%)
71.5 1
0.1%
55 1
0.1%
54 1
0.1%
44.2 1
0.1%
42 1
0.1%
40 1
0.1%
37.2 1
0.1%
35 1
0.1%
31 1
0.1%
30 1
0.1%

Interactions

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

ID_EVENTOID_LOTEID_FINCAID_PRODLAT_LOTELONG_LOTEFECHA_SIEMBRATIPO_SIEMBRANUM_SEMILLASSEM_TRATADASDIST_SURCOSDIST_PLANTASTIPO_CULTIVOCOLOR_ENDOSPERMOSEM_POR_SITIOTIPO_DE_SEMILLAHABITO_CRECIMIENTOMATERIAL_GENETICOOBJ_RDTCULT_ANTDRENAJEFECHA_EMERGENCIAPOBLACION_20DIASFECHA_FLORACIONFECHA_COSECHAMETODO_COSECHARDTPROD_COSECHADONOMBRE_LOTEORIGEN_SEMILLAINOCULACION_SEMILLASNUEVA_INOCULACION_SEMILLASPRODUCTO_USADOTIPO_MATERIALNUEVO_MATERIAL_GENETICOOTRO_CULT_ANTRESIEMBRACANTIDAD_TOTALHUMEDADALMACENAMIENTO_FINCAOBSERVACIONES_COSECHADEPARTAMENTOMUNICIPIOAREA
0534042138.877222-75.7644445/13/2013Mecanizado60000.0NO0.80.2MaizBlanco2.0NaNNaNPIONEER 30F325000.0AlgodónSI5/18/201360000.07/20/20139/26/2013Manual5000.0Grano secoVILLA GABRIELANaNNaNNaNNaNHibridoNaNNaNNaN5000.018.0NONaNCÓRDOBACERETÉ1.0
1544343148.879167-75.7655565/2/2013Mecanizado60000.0SI0.80.2MaizBlanco1.0NaNNaNDK 2345.0MaizSI5/7/201360000.07/10/20139/11/2013Manual5000.0Grano secoVILLA LOURDESNaNNaNNaNInsecticidasHibridoNaNNaNNaN5000.020.0NONaNCÓRDOBACERETÉ1.0
2564444158.880000-75.7658335/12/2013Mecanizado60000.0NO0.80.2MaizBlanco1.0NaNNaNPIONEER 30F325.0AlgodónSI5/17/201360000.07/15/20139/19/2013Manual5500.0Grano secoSANTA MARTANaNNaNNaNNaNHibridoNaNNaNNaN5500.019.0NONaNCÓRDOBACERETÉ1.0
3574545168.878611-75.7588895/7/2013Mecanizado60000.0NO0.80.2MaizBlanco1.0NaNNaNOtro5.0AlgodónSI5/12/201360000.07/15/20139/12/2013Manual5200.0Grano secoPALMARNaNNaNNaNNaNHibrido7019NaNNaN5200.019.0NONaNCÓRDOBACERETÉ1.0
42734646178.876389-75.7591675/7/2013Mecanizado60000.0NO0.80.2MaizBlanco1.0NaNNaNOtro5000.0AlgodónSI5/12/201360000.07/14/20139/12/2013Manual5700.0Grano secoDONDE FIDELNaNNaNNaNNaNHibrido7019NaNNaN5700.020.0NONaNCÓRDOBACERETÉ1.0
52824747188.847500-75.7666675/8/2013Mecanizado60000.0SI0.80.2MaizBlanco2.0NaNNaNDK 2345.0MaizSI5/14/201360000.07/25/20139/18/2013Manual5200.0Grano secoVILLA LUZNaNNaNNaNInsecticidasHibridoNaNNaNNaN5200.018.0NONaNCÓRDOBACERETÉ1.0
62835150228.847778-75.7675005/2/2013Manual60000.0SI0.80.2MaizBlanco3.0NaNNaNOtro5.0MaizSI5/7/201360000.06/23/20139/17/2013Manual4700.0Grano secoVILLA MARIANaNNaNNaNInsecticidasHibrido1430NaNNaN4700.019.0NONaNCÓRDOBACERETÉ1.0
72842682402668.853056-75.7577785/9/2013Manual60000.0SI0.80.2MaizBlanco3.0NaNNaNPIONEER 30F325.0AlgodónSI5/14/201360000.06/25/20139/22/2013Manual6000.0Grano secoDONDE TIBERIONaNNaNNaNInsecticidasHibridoNaNNaNNaN6000.017.0NONaNCÓRDOBACERETÉ1.0
82862692412678.852500-75.7572225/3/2013Manual60000.0SI0.80.2MaizBlanco2.0NaNNaNDK 2345.0AlgodónSI5/9/201360000.06/20/20139/14/2013Manual5500.0Grano secoDONDE PINTONaNNaNNaNInsecticidasHibridoNaNNaNNaN5500.020.0NONaNCÓRDOBACERETÉ1.0
92872702422688.848889-75.7647225/26/2013Mecanizado60000.0NO0.80.2MaizAmarillo1.0NaNNaNPIONEER 30F355.0FrijolSI5/31/201360000.07/26/201310/5/2013Manual4500.0Grano secoVILLA CELIANaNNaNNaNNaNHibridoNaNNaNNaN4500.019.0NONaNCÓRDOBACERETÉ1.0
ID_EVENTOID_LOTEID_FINCAID_PRODLAT_LOTELONG_LOTEFECHA_SIEMBRATIPO_SIEMBRANUM_SEMILLASSEM_TRATADASDIST_SURCOSDIST_PLANTASTIPO_CULTIVOCOLOR_ENDOSPERMOSEM_POR_SITIOTIPO_DE_SEMILLAHABITO_CRECIMIENTOMATERIAL_GENETICOOBJ_RDTCULT_ANTDRENAJEFECHA_EMERGENCIAPOBLACION_20DIASFECHA_FLORACIONFECHA_COSECHAMETODO_COSECHARDTPROD_COSECHADONOMBRE_LOTEORIGEN_SEMILLAINOCULACION_SEMILLASNUEVA_INOCULACION_SEMILLASPRODUCTO_USADOTIPO_MATERIALNUEVO_MATERIAL_GENETICOOTRO_CULT_ANTRESIEMBRACANTIDAD_TOTALHUMEDADALMACENAMIENTO_FINCAOBSERVACIONES_COSECHADEPARTAMENTOMUNICIPIOAREA
98846664165439147189.094833-75.7831865/23/2016Mecanizado18.0NO0.80.20MaizAmarillo1.0NaNNaNOtro6100.0AlgodónSI5/28/201659000.07/19/201610/8/2016Mecanizada6160.0Grano secoLAS MARIASNaNNaNNaNNaNOGMDK 7088 VTPRONaNNaN20910.018.0NONaNCÓRDOBACOTORRA3.4
98946674161438747169.075958-75.7213695/8/2016Mecanizado18.5NO0.80.18MaizAmarillo1.0NaNNaNADV 9293 (Syngenta)7200.0AlgodónSI5/14/201667000.07/4/20169/24/2016Manual7000.0Grano secoLA ESPERANZANaNNaNNaNNaNHibridoNaNNaNNaN32200.018.0NONaNCÓRDOBACHIMÁ4.6
99046684160438647169.073700-75.7207255/8/2016Mecanizado18.0NO0.80.18MaizAmarillo1.0NaNNaNOtro7800.0AlgodónSI5/13/201670000.07/2/20169/26/2016Mecanizada7000.0Grano secoLA COMPAÑIANaNNaNNaNNaNHibridoSV 1035NaNNaN30100.018.0NONaNCÓRDOBACHIMÁ4.3
99146694256448447529.076806-75.7266115/23/2016Mecanizado18.2NO0.80.20MaizAmarillo1.0NaNNaNOtro6780.0AlgodónSI5/28/201660000.07/17/201610/10/2016Mecanizada6500.0Grano secoLA ESPERANZANaNNaNNaNNaNHibridoADT-9293NaNNaN14300.018.0NONaNCÓRDOBACOTORRA2.2
99246704268449647569.086806-75.7359005/11/2016Manual17.5NO0.80.40MaizAmarillo3.0NaNNaNOtro7100.0MaizSI5/17/201661000.07/5/20169/22/2016Manual6980.0Grano secoEL CAMPANITONaNNaNNaNNaNHibridoADT-9339NaNNaN6980.017.0NONaNCÓRDOBACOTORRA1.0
99346714323454847879.018844-75.7585315/21/2016Manual17.5NO0.80.40MaizAmarillo3.0NaNNaNOtro5300.0MaizSI5/25/201664200.07/13/201610/4/2016Manual7150.0Grano secoSANTA MARTANaNNaNNaNNaNHibridoSV-1035NaNNaN7150.017.0NONaNCÓRDOBASAN PELAYO1.0
99446724322454747868.871503-75.7724115/14/2016Mecanizado19.2NO0.80.20MaizBlanco1.0NaNNaNOtro6980.0MaizSI5/20/201660000.07/8/20169/30/2016Mecanizada6200.0Grano secoSALSIPUEDESNaNNaNNaNNaNHibridoSV-7019NaNNaN13020.017.0NONaNCÓRDOBACERETÉ2.1
99546734321454647858.853453-75.7511925/7/2016Mecanizado17.4NO0.80.20MaizBlanco1.0NaNNaNOtroNaNAlgodónNO5/12/201659800.07/2/20169/23/2016Mecanizada6326.0Grano secoLA PALMERANaNNaNNaNNaNHibridoSV-7019NaNNaN12652.017.0NONaNCÓRDOBACERETÉ2.0
99646744320454547849.034286-75.7804585/9/2016Mecanizado18.5NO0.80.18MaizBlanco1.0NaNNaNDK 234 YGRR6520.0AlgodónSI5/15/201670000.07/3/20169/21/2016Mecanizada6600.0Grano secoTIGRENaNNaNNaNNaNOGMNaNNaNNaN14520.019.0NONaNCÓRDOBACOTORRA2.2
99746754319454447849.031350-75.7799535/9/2016Mecanizado17.8NO0.80.18MaizBlanco1.0NaNNaNP3966 (Pioneer)6400.0AlgodónSI5/14/201667600.07/5/20169/21/2016Mecanizada6440.0Grano secoCARACOLNaNNaNNaNNaNOGMNaNNaNNaN14812.018.0NONaNCÓRDOBACOTORRA2.3